Fractal Dimension Analysis of Dna Sequences
نویسنده
چکیده
This paper presents fractal measures for deoxyribonucleic acid (DNA) sequences using fractal and multifractal methodologies ([Kins94] and [RiKi99]), and demonstrates the intriguing possibility of identifying biological functionality using information contained within the DNA sequence. A vast amount of genetic material has been sequenced to date with an exponential growth rate. By 2005, human genome in a total of 3 x 109 base pairs will have been sequenced. But the rate of finding genes and their location at chromatosomes is relatively slow. Therefore, robust computational solutions to the DNA sequences analysis are a valuable way to the molecular biology. Most of the current research in the deciphering the meaning of DNA sequences is approached from the lowest level. Its main objective is to search for patterns or correlations existing in the DNA sequence related to codons, amino acids, and proteins using a variety of sophisticated computational techniques, including neural network algorithms, dynamic programming, rule-based methods, decision trees, probabilistic reasoning, and hidden Markov chain. Although none of them perform perfectly, the information they provide is so valuable that some of them are even used for recognition putative genes in uncharacterized DNA. In this paper, we have developed techniques that: (1) transform the DNA sequence alphabets to signal spectrum based on the Lévy-walk [BeRO96]; (2) estimate single fractal and multifractal dimensions; (3) power spectrum analysis. Experimental results show that (i) DNA is multifractal, and that the multifractality changes depending upon the location (coding or noncoding region) in the sequence; (ii) local polymer dimension fluctuated with the variation of the local DNA sequence; and (iii) there is a significant difference between coding region and noncoding region with DNA sequence power spectrum and the β value of them is around -1. AcknowledgmentsThis research has been supported in part by the Natural Sciences and Engineering Council (NSERC) of Canada. References[BeRO96] P. Bernaola-Galvan, R. Roman-Roldan, and J.L. Oliver, “Compositional seqmentation and long-rangefractal correlations in DNA sequences,” Phys. Rev. E, vol. 53, pp. 5181-5189, 1996.[Kins94] W. Kinsner, “Fractal dimensions: Morphological, entropy, spectrum, and variance classes,” TechnicalReport, DEL94-5, University of Manitoba, May 1994, 146 pp.[RiKi99] R. Rifaat and W. Kinsner, “Multifractal analysis of DNA sequences,” Proc. of IEEE CCECE’99,Edmonton, Canada, pp. 801-804,1999. ADVISOR: W. Kinsner, Ph. D., P. Eng.
منابع مشابه
Feature Extraction from Dna Sequences by Multifractal Analysis
This paper presents feature extraction and estimation of multifractal measures of DNA sequences, using a multifractal methodology, and demonstrates a new scheme for identifying biological functionality, using information contained within the DNA sequences. It shows that the Rényi and Mandelbrot fractal dimension spectra may be useful techniques for extracting the information contained in the DN...
متن کاملThe Application of fractal dimension and morphometric properties of drainage networks in the analysis of formation sensibility in arid areas (Case Study, Yazd-Ardakan Basin)
Introduction: Many natural phenomena have many variables that make it difficult to find relationships between them using common mathematical methods. This problem, along with the impossibility of measuring all elements of nature, has led to a major evolution in the way of understanding and explaining phenomena. In this way, one can use the fractal geometry with the theory that many natural phen...
متن کاملAn Efficient Algorithm for Measuring Fractal Dimension of Complex Sequences
The paper presents algorithms for measuring the fractal dimension of real and complex digital signals embedded in one, two or higher dimensional spaces. Simple, yet efficient MATLAB implementations, allowing the fast handling of whole chromosome complex genomic signals, are described. Both global and local fractal dimension measurements are performed, as required by the fractal structure analys...
متن کاملFractal Dimension of Human Chromosome 22
Based on complex genomic signal analysis, it has recently been reported that DNA sequences show large-scale regularities, at the scale of whole chromosomes. The paper shows that these long range correlations can also be found in a fractal-like structure of DNA genomic signals, as well as in their 1/f noise. Specifically, the paper studies the fractal dimension of DNA segments along homo sapiens...
متن کاملDiagnosis of Lung Cancer by Fractal Analysis of Damaged DNA
Cancer starts when cells in a part of the body start to grow out of control. In fact cells become cancer cells because of DNA damage. A DNA walk of a genome represents how the frequency of each nucleotide of a pairing nucleotide couple changes locally. In this research in order to study the cancer genes, DNA walk plots of genomes of patients with lung cancer were generated using a program writt...
متن کاملMulti-Fractal Analysis for Feature Extraction from DNA Sequences
This paper presents estimations of multi-scale (multi-fractal) measures for feature extraction from deoxyribonucleic acid (DNA) sequences, and demonstrates the intriguing possibility of identifying biological functionality using information contained within the DNA sequence. We have developed a technique that seeks patterns or correlations in the DNA sequence at a higher level than the local ba...
متن کامل